Formant model estimation and transformation for voice morphing
نویسندگان
چکیده
In this paper we consider the estimation and mapping of timevarying formant model parameters and orders for voice transformation. The model order is the number of perceptually significant formant trajectories estimated from an analysis of the poles of “over-modelled’’ linear prediction models of the source and target speech. A 2-D HMM with NF left–to-right states across frequency and M states across time is used to classify formant observations into NF sequential formant clusters. A formant-based non-uniform frequency warping method is proposed for voice transformation. In this method speech spectrum is divided into NF+1 formant bands. A transformation is estimated for each formant band of a phoneme model. Multi-mixture Gaussians are used to model the distribution of parameters in each formant band. The voice mapping yields perceptually high quality results.
منابع مشابه
Probability models of formant parameters for voice conversion
This paper explores the estimation and mapping of probability models of formant parameter vectors for voice conversion. The formant parameter vectors consist of the frequency, bandwidth and intensity of resonance at formants. Formant parameters are derived from the coefficients of a linear prediction (LP) model of speech. The formant distributions are modelled with phonemedependent two-dimensio...
متن کاملVocal Effort Modification for Singing Synthesis
Vocal effort modification of natural speech is an asset to various applications, in particular, for adding flexibility to concatenative voice synthesis systems. Although decreasing vocal effort is not particularly difficult, increasing vocal effort is a challenging issue. It requires the generation of artificial harmonics in the voice spectrum, along with transformation of the spectral envelope...
متن کاملVoice Morphing Using the Generative Topographic Mapping
In this paper we address the problem of Voice Morphing. We attempt to transform the spectral characteristics of a source speakers speech signal so that the listener would believe that the speech was uttered by a target speaker. The voice morphing system transforms the spectral envelope as represented by a Linear Prediction model. The transformation is achieved by codebook mapping using the Gen...
متن کاملStudy on manipulation method of voice quality based on the vocal tract area function
This paper describes a new manipulation method of voice quality which is based on the STRAIGHT analysis-synthesis system. This method manipulates voice quality by changing the vocal tract area function calculated from the PARCOR coefficients. The PARCOR coefficients used in the proposed method is obtained from the auto-correlation function of the STRAIGHT spectrum. We have implemented a simple ...
متن کاملA comparative study of spectral transformation techniques for singing voice synthesis
Studies show that professional singing matches well the associated melody and typically exhibits spectra different from speech in resonance tuning and singing formant. Therefore, one of the important topics in speech-to-singing conversion is to characterize the spectral transformation between speech and singing. This paper extends two types of spectral transformation techniques, namely voice co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002